RDB-MINER: A SQL-Based Algorithm for Mining True Relational Databases

نویسنده

  • Abdallah Alashqur
چکیده

Traditionally, research in the area of frequent itemset mining has focused on mining market basket data. Several algorithms and techniques have been introduced in the literature for mining data represented in basket data format. The primary objective of these algorithms has been to improve the performance of the mining process. Unlike basket data representation, no algorithms exist for mining frequent itemsets and association rules in relational databases that are represented using the formal relational data model. Typical relational data can not be easily converted to basket data representation for the purpose of applying frequent itemset mining algorithms. Therefore, a need arises for algorithms that can directly be applied to data represented using the formal relational data model and for a conceptual framework for mining such data. This paper solves this problem by introducing an algorithm named RDB-MINER for mining frequent itemsets in relational databases.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Association Rules for Predicting Customer Lifetime Value in Retail Banking Context Based on RDB- MINER Algorithm

Data mining methodology has a tremendous contribution for extracting the hidden knowledge and patterns from the existing databases. Traditionally, researchers use basket data to mine association rules of which the basic task is to find the frequent items. For relational databases whose data format is relational data other than basket data, RDB-MINER algorithm was proposed. In this paper, we int...

متن کامل

GPSQL Miner: SQL-Grammar Genetic Programming in Data Mining

The present work describes GPSQL Miner, a Genetic Programming system for mining relational databases. This system uses Grammar Genetic Programming for classification’s task and one of its main features is the representation of the classifiers. The system uses SQL grammar, which facilitates the evaluation process, once the data are in relational databases. The tool was tested with some databases...

متن کامل

Empirical Analysis on Comparing the Performance of Alpha Miner Algorithm in SQL Query Language and NoSQL Column-Oriented Databases Using Apache Phoenix

Process-Aware Information Systems (PAIS) is an IT system that support business processes and generate large amounts of event logs from the execution of business processes. An event log is represented as a tuple of CaseID, Timestamp, Activity and Actor. Process Mining is a new and emerging field that aims at analyzing the event logs to discover, enhance and improve business processes and check c...

متن کامل

Integrating RDMS and Data Mining capabilities using

Mining information from large databases has been recognized as a key research topic in database systems. The explosive growth of databases has made neccesary to discover techniques and tools to transform the huge amount of stored data, into useful information. Rough Set Theory 18] has been applied since its very beginning to diierent application areas. This chapter presents an integration of Re...

متن کامل

SQL Based Association Rule Mining using Commercial RDBMS (IBM DB2 UDB EEE)

Data mining is becoming increasingly important since the size of databases grows even larger and the need to explore hidden rules from the databases becomes widely recognized. Currently database systems are dominated by relational database and the ability to perform data mining using standard SQL queries will definitely ease implementation of data mining. However the performance of SQL based da...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • JSW

دوره 5  شماره 

صفحات  -

تاریخ انتشار 2010